Home

Home

20.5.1 메모리 관리 혁신: PagedAttention과 vLLM

Home / 인공지능 (Artificial Intelligence, AI) / 제목: Embodied AI & Modern Control / Chapter 20. 파운데이션 모델의 경량화와 엣지 배포 (Efficient Deployment) / 20.5 추론 가속화와 런타임 최적화 (Inference Acceleration & Runtime Optimization) / 20.5.1 메모리 관리 혁신: PagedAttention과 vLLM

20.5.1 메모리 관리 혁신: PagedAttention과 vLLM

Generated by Rust Site Gen